Validation and improvement of automatic phonetic transcriptions
نویسندگان
چکیده
The ultimate aim of our research is to show that good-quality phonetic transcriptions of large speech corpora can be obtained by employing automatic techniques initially developed for ASR. The experiment presented in this paper has two aims. The first is to show how the quality of an automatic transcription that is easily obtained through lexicon lookup can be measured in a way that is methodologically sound. The second is to show how, while measuring the quality of an automatic transcription, it is possible to obtain information that can subsequently be used to improve the automatic transcription where necessary. As a result, correction by human transcribers should become more efficient or even superfluous.
منابع مشابه
Application-oriented validation o preliminary r
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...
متن کاملA pplication-orien ted validation o f phonetic transcriptions: prelim inary results
There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni tion technology. Such automatic phonetic transcrip tions are usually val...
متن کاملValidation of phonetic transcriptions based on recognition performance
In fundamental linguistic as well as in speech technology re search there is an increasing need for procedures to automat ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...
متن کاملValidation of phonetic transcriptions in the context of automatic speech recognition
Some of the speech databases and large spoken language corpora that have been collected during the last fifteen years have been (at least partly) annotated with a broad phonetic transcription. Such phonetic transcriptions are often validated in terms of their resemblance to a handcrafted reference transcription. However, there are at least two methodological issues questioning this validation m...
متن کاملTitle : Automatic Phonetic Transcription of Large Speech Corpora
Most large speech corpora are delivered with a lexicon that contains a canonical transcription of every word in the orthographic transcription. Such a lexicon can be used for generating a hypothetical ‘canonical’ phonetic transcription from the orthography. In addition, time and money permitting, some speech corpora are provided with a manually verified broad phonetic transcription of at least ...
متن کامل